SEQUENCE LISTING 



<110> benoit, Patrick 

SCHWARTZ, Bert rand 
BRANELLEC, Didier 
CHIEN, Kenneth R. 

<120> SEQUENCES UPSTREAM OF THE CARP GENE, VECTORS CONTAINING 
THEM AND USES THEREOF 

<130> 03806.0530-00000 

<140> 

<141> 2001-12-07 

!::;<i6o> 5 

C <170> Paten tin ver. 2.1 

S<210> 1 
UJ<211> 2358 
UJ<212> DNA 
%1<213> Mus musculus 



U<400> 1 



Pljggatcctttc 

Q 

jggctttgcca 


atgtttaaca 


atatcaaccc 


taacccaagg 


ggaacagcct 


gcctgacagt 


60 


cccatgaata 


cttcctagtc 


tagtccgttt 


gtgaaactca 


gcccatccca 


120 


gacacttctgc 


aagccccatc 


ctctacaagg 


tgctcattgg gaatttcctg 


gagcttctct 


180 


ttcaggatca 


gcctgattct 


agggcagcag 


ttctcaacct 


gggggcctcg 


acccctttgg 


240 


gggaatcaaa 


cgacccttta 


caggggtcac 


atatcatcta 


tcctatatgt 


caggtattta 


300 


cattacgatt 


cgtaacagta 


gcaaaattac 


aggtatgaaa tagcaatgaa 


ataattttat 


360 


gattgaaggt 


caccacaaca 


tgaggccgcc 


acactgttct 


agagaaaaat 


cacctgggtg 


420 


gggaaaggtt 


tgggaaagcc 


tttctgtcca 


ttcttcattc 


ttcaaagtga 


tgtgttcaca 480 


gaaagccttt 


cagctgttct 


gctggggctc 


ttagtaagtc tgagtaggaa 


ctgtatgtac 


540 


caggtctgct 


tcttatgggt 


ggagccaaga 


cgcatcgtgg 


gtggagcgaa 


gacgcaacct 


600 


caccttctag 


ctctgcatcc 


atagcaagta 


gcctaatgtt 


tctgtgtcta 


ggtgtcatct 


660 


ctgtgaatcg 


agatccttgg 


ccttgcttga 


attagggagg 


cacaaaatac 


tcagagattc 


720 



1 



" aagactgctc agcagcccag agtccttcct 
ttagctctga gtcaggcctg gaacaaacgg 
cttgtcactt caagaggtca aagaaaatag 
ttatccattg atagcgtctc aggacagata 
cacgaaggac aaggtattag tgtgttggtt 
agaaacacaa tttgctggtt gaacagctga 
tggaagggtg agtgaggaga gacagatata 
ccctaattaa cacttccctc ttctactgac 
taaaaaaagt attttatgtg gctcttacga 
taaatattta tatttttcac attttaatat 
Jjttttgcctct ctcaacagca aagcttgggg 
Qcgagagcccc gtgtatctag gcagatgctc 

Wfacgcacattt ttctcgggct ctcttaagct 

Ul 

ujagcatccaag ttgtgcttct gctaagaact 
/cccgttttgg ggtagatcct ctgattagcc 
fygcactaatta tggccagtga caccatagag 
llctcctaatgc tggtacgatg gcatgtcaca 
Jagaattccaa acagatagag acaagtggca 
ttatccccag aaataggatg tcccaaagca 
gtccagttat cagaaagata tggctgtaag 
acgttagtca tatgagagct gacaaagaag 
acaggcagct gtcccctggc ttcccgatac 
gtcactgcca aaggaatgac cctctcacat 
cttgtcatct ccctcttggg cttcccagac 
ctgaattggc cactggtggg ggcaggggtg 
tcacccagcc ctagctatat aacgggctgg 
gggttcatcc acaagagaga aaaacataga 



caaaggaaag gtctcaactc tcagcccccc 780 
ccacaggaat gagaaaagct gccatagctg 840 
tgttaaccat gaaaacgaga agaccaacag 900 
ggacagagag aacactagga gaggggaacc 960 
ttcagggcaa tgtcttgtac tgaagattct 1020 
agtggggtgg gggttcttac cccatgttca 1080 
tgatggccag cataacaaac atacacaaca 1140 
acccccttca ctctcctctt tcataaaaaa 1200 
tagaatcttt cctcgaacta taaaaagatc 1260 
cttagcgatg acaagccaga aacaagtatt 1320 
cctttttgtt tccgtgttag gaatagaaca 1380 
tatcattagc ccatgagtct ccagcctcag 1440 
tttcccacag cattgggaaa ctttactgac 1500 
ggactcacat ctctctgtgc atcacttcgg 1560 
ttcagattta gaacacggtg agcctgtggt 1620 
tcaaagtgca ttactgaatg ctttcaattt 1680 
gggccatttt agctgcagac atcactccag 1740 
cccagaccca tctccttccc ctcgggctga 1800 
acacttccca gccaactgga gtgctgataa 1860 
tgtgatgcac agtgcttgca ttttcttgat 1920 
gaaaaagagc agcgatgtgg tgcaatatta 1980 
gtgggatgac tcgcattgct gagcggtgtg 2040 
ttcttcctga ttcgcatacg ccgcggccag 2100 
actaagtctg gaatgaaaat tcacctgcct 2160 
tgacttggct tcccaggctg gaagattatc 2220 
tgtggagggg ctccacaggg ccagttccag 2280 
ctcgaggtct agggagcttg catgcctgca 2340 



ggtcggaggc caccatgg 2358 

<210> 2 

<211> 2074 

<212> DNA 

<213> Homo sapiens 

<400> 2 

ctgcagcaag ttacttaatg ttttttgcct cagcatcctc tctgtaaaat gagagcatta 60 
gtcttgctcc aacttcgagg gcatggacag ctctgggatt tcatatccaa gacccttaaa 120 
catcccacag tccttccccc aaacacttct cctcctaata cctccctcag tttgggtcag 180 
gcctggaaca aaaaggcata cgaaatggta gaaaaagtgt ccatgactac ttctgactta 240 
h= gatgaagaga ccaatgaaaa tagtaatgac tctgtttgct tcagcaggac atatactaaa 300 
| ataggagcta tacaaagaag attagcatgg actctgtgca agaatgacac acaaatttgt 360 
jjj gaaacattcc atatattaaa aataaataaa taataaagag aaaaggaaaa aattaaaaag 420 
jjj aaaatagtga tagctgtgtc catctcaaag aaaagcccag gagatttcct ttatttaccc 480 
" ; cctttaagat agaatattag gagaccggaa catatgatac aggaggtact gggagggtcc 540 
Jctctttgtca atgttttgtc ttggggtggg gagtcgatgt cttctcaaag tttcagaaac 600 
Haccatccact gactgagcat tcaaggggca agaggagaat ggcagccaca tttgttgatt 660 
□gggtgagttt ggggagaaat agacacacaa aggtcaaaca taacttccta attaacactt 720 
ccctccattc acaattccct tctcccattc ttctctcctg tcttttacts akaraaaccc 780 
agtttttcct gaaactataa aaataccccc agtatgttta cataatttac acctcaaaga 840 
ttagaaacca gaaatagaga ccttttcaac ccttccggaa gcaaagtgca ttatccctcc 900 
agccacgtgt ctcaaatctt gatgcatcag aatcatctgg gtgctttkaa attcaagatg 960 
attcctacga gttaccataa atcaactcag aattccctgg agtggggcca gggatctgta 1020 
tttctgacaa gctcccacag gtgattcctt tccccacagc atttgagaac ttcagctcaa 1080 
tgacctaatc agagtcctgc cattgctaat atctggtctc atttttbtca tatatatata 1140 
tagtatttgt ggtagagatg ggattttgcc atgttgccca ggctagtatt gaactcctaa 1200 
gctaagcaat cttcctgtct ctgcctccca aaatgttggg attacaggtg taagccactg 1260 

3 



cacccggctg atagctggtt tcatttactc tatttcttga ccactctgat ccattttgaa 1320 
gtaaaaatgc tccaattatt atgctgtttt agaacacggt aagcatgtca tgtgctaatg 1380 
gccagtgaca tcataaaaga aaagtgcatt actgaatgct ttcaatgtct tataatgatg 1440 
gtaaggtggc atgtcatggg gcctatttag cccagacatc actccaaaga attccaaaca 1500 
gatatagaca agtgccttta gggcccagat cccttcccct caggctgttt acccagggaa 1560 
taggatgtcc tgggacaagt ttcccctaag tgaagtgttg ataagtctgc ttatcagaaa 1620 
gatattactg ggggtgtgat atgtagggca tctacatttt cttgataggt agtcatatga 1680 
aagctgacaa agaaaaaaag ggcagtgatg tggtgcaatg tcaacagaca gctgtcccct 1740 
gactcttgac aaataggatg acttgcattg ctgagcgatg tgatcaccac caaaggaatg 1800 
gccctctcac atttcttcct gattcacata ttcagcaggg ttagcttgtc ctcccctccc 1860 
-jtcttcagctt cccagacact gagtctggaa tgaaaattca cctgcctctg agttggctcc 1920 
|taatgggggc gggagtgtta cttcggttcc caggttggaa gattatctca cccggcccca 1980 
jjgctatataag ctgaccggtg tggaggggcc cagcagggcc aactccaggg attccttcca 2040 

M 

ycgacagaaaa acatacaaga ctccttcagc caac 2074 



:;<2io> 3 

,:j<211> 750 
;:<212> PRT 
: ^<213> Homo sapiens 

=l <400> 3 

Met Gly Glu Thr Leu Gly Asp Ser Pro He Asp Pro Glu Ser Asp Ser 
1 5 10 

Phe Thr Asp Thr Leu Ser Ala Asn He Ser Gin Glu Met Thr Met Val 
20 25 30 

Asp Thr Glu Met Pro Phe Trp Pro Thr Asn Phe Gly lie Ser Ser Val 
35 40 45 

Asp Leu Ser Val Met Glu Asp His Ser His Ser Phe Asp lie Lys Pro 
50 55 60 

Phe Thr Thr Val Asp Phe Ser Ser lie Ser Thr Pro His Tyr Glu Asd 
65 70 75 80 

lie Pro Phe Thr Arg Thr Asp Pro Val Val Ala Asp Tyr Lys Tyr Asp 

85 90 95 

4 



Leu Lys Leu Gin Glu Tyr Gin Ser Ala lie Lys val Glu Pro Ala Ser 

100 105 110 

Pro Pro Tyr Tyr ser Glu Lys Thr Gin Leu Tyr Asn Lys Pro His Glu 

115 120 125 

Glu Pro Ser Asn Ser Leu Met Ala lie Glu Cys Arg Val Cys Gly Asp 

130 135 140 

Lys Ala Ser Gly Phe His Tyr Gly val His Ala Cys Glu Gly Cys Lys 

145 150 155 ' 160 

Gly Phe Phe Arg Arg Thr lie Arg Leu Lys Leu lie Tyr Asp Arg Cys 

165 " 170 175 

Asp Leu Asn Cys Arg lie His Lys Lys Ser Arg Asn Lys Cys Gin Tyr 

180 " ' 185 ~ " 190 

Cys Arg Phe Gin Lys Cys Leu Ala Val Gly Met Ser His Asn Ala lie 

195 200 ' 205 



nArg Phe Gly Arg Met Pro Gin Ala Glu Lys Glu Lys Leu Leu Ala Glu 
210 215 " 220 



■4 5 



id lie Ser Ser Asp lie Asp Gin Leu Asn Pro Glu Ser Ala Asp Leu Arg 

T;225 230 235 240 



Ala Leu Ala Lys His Leu Tyr Asp ser Tyr lie Lys ser Phe Pro Leu 

245 ' 250 255 



!Thr Lys Ala Lys Ala Arg Ala lie Leu Thr Gly Lys Thr Thr Asp Lys 
f 260 ~ 265 270 

l 

L Ser Pro Phe Val lie Tyr Asp Met Asn ser Leu Met Met Gly Glu Asp 
275 280 285 

Lys lie Lys Phe Lys His lie Thr Pro Leu Gin Glu Gin ser Lys Glu 
290 295 300 

Val Ala lie Arg lie Phe Gin Gly Cys Gin Phe Arg ser Val Glu Ala 
305 310 315 ~ 320 

val Gin Glu lie Thr Glu Tyr Ala Lys ser lie Pro Gly Phe Val Asn 

325 330 335 

Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly Val His Glu 
340 345 350 

He lie Tyr Thr Met Leu Ala ser Leu Met Asn Lys Asp Gly val Leu 
355 360 365 

lie ser Glu Gly Gin Gly Phe Met Thr Arg Glu Phe Leu Lys Ser Leu 

5 



370 



375 



380 



Arg Lys Pro Phe 
385 

Lys Phe Asn Ala 



Ala val 
Pro lie 



Leu Lys 
450 



lie lie 
420 

Glu Asp 
435 



Gly Asp Phe Met Glu Pro Lys 
390 395 

Leu Glu Leu Asp Asp ser Asp 
405 410 

Leu ser Gly Asp Arg Pro Gly 

425 

lie Gin Asp Asn Leu Leu Gin 
440 



Leu Asn His Pro Glu Ser Ser Gin Leu 

455 



Met Thr 
uLeu Gin val lie 



Gin Lys 
465 



=Xeu Gin 



lys Thr 



Glu He 
500 

Thr Asp 
515 



Met Met 
530 



Gly Glu 
iGlu Gin Ser Lys 
val Glu 



1545 

s 
VI 

i Arg Ser 



Pro Gly Phe Val 
580 

Tyr Gly val His 
595 

Lys Asp Gly val 
610 

Phe Leu Lys Ser 
625 

Phe Glu Phe Ala 



Asp Leu Arg Gin lie val Thr 
470 ~ 475 

Lys Lys Thr Glu Thr Asp Met 
485 490 

Tyr Lys Asp Leu Tyr Ala Trp 

505 

Lys Ser Pro Phe Val lie Tyr 
520 

Asp Lys lie Lys Phe Lys His 
535 

Glu Val Ala lie Arg lie Phe 
550 * 555 

Ala val Gin Glu lie Thr Glu 
565 570 

Asn Leu Asp Leu Asn Asp Gin 

585 

Glu lie lie Tyr Thr Met Leu 
600 

Leu lie Ser Glu Gly Gin Gly 
615 

Leu Arg Lys Pro Phe Gly Asp 
630 635 

Val Lys Phe Asn Ala Leu Glu 
645 650 



Phe Glu Phe Ala Val 

400 

Leu Ala lie Phe lie 
415 

Leu Leu Asn Val Lys 
430 

Ala Leu Glu Leu Gin 
445 

Phe Ala Lys Leu Leu 
460 

Glu His val Gin Leu 

480 

Ser Leu His Pro Leu 
495 

Ala lie Leu Thr Gly 
510 

Asp Met Asn Ser Leu 
525 

lie Thr Pro Leu Gin 
540 

Gin Gly Cys Gin Phe 

560 

Tyr Ala Lys Ser lie 
575 

val Thr Leu Leu Lys 
590 

Ala Ser Leu Met Asn 
605 

Phe Met Thr Arg Glu 
620 

Phe Met Glu Pro Lys 

640 

Leu Asp Asp Ser Asp 
655 



Leu Ala He Phe He Ala val He lie Leu ser Gly Asp Arq Pro Gly 
660 665 670 

Leu Leu Asn Val Lys Pro lie Glu Asp lie Gin Asp Asn Leu Leu Gin 
675 680 685 

Ala Leu Glu Leu Gin Leu Lys Leu Asn His Pro Glu Ser Ser Gin Leu 
690 695 700 

Phe Ala Lys Leu Leu Gin Lys Met Thr Asp Leu Arg Gin He Val Thr 
705 710 715 720 

Glu His val Gin Leu Leu Gin val lie Lys Lys Thr Glu Thr Asp Met 

725 730 735 

Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
740 745 ' 750 

.,,<210> 4 
- <211> 30 
S<212> DNA 

g<213> Artificial sequence 

!'[]<400> 4 

JGGCGATTTAA ATAATGTAGT CTTATGCAAT 30 



-<210> 5 

rli<211> 31 
□<212> DNA 

^l<213> Artificial sequence 



hK400> 5 

GGGGTCTAGA AGGTGCACAC CAATGTGGTG A 31 
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